Content Analysis by the Crowd: Assessing the Usability of Crowdsourcing for Coding Latent Constructs
نویسندگان
چکیده
Crowdsourcing platforms are commonly used for research in the humanities, social sciences and informatics, including the use of crowdworkers to annotate textual material or visuals. Utilizing two empirical studies, this article systematically assesses the potential of crowdcoding for less manifest contents of news texts, here focusing on political actor evaluations. Specifically, Study 1 compares the reliability and validity of crowdcoded data to that of manual content analyses; Study 2 proceeds to investigate the effects of material presentation, different types of coding instructions and answer option formats on data quality. We find that the performance of the crowd recommends crowdcoded data as a reliable and valid alternative to manually coded data, also for less manifest contents. While scale manipulations affected the results, minor modifications of the coding instructions or material presentation did not significantly influence data quality. In sum, crowdcoding appears a robust instrument to collect quantitative content data.
منابع مشابه
Perform Three Data Mining Tasks with Crowdsourcing Process
For data mining studies, because of the complexity of doing feature selection process in tasks by hand, we need to send some of labeling to the workers with crowdsourcing activities. The process of outsourcing data mining tasks to users is often handled by software systems without enough knowledge of the age or geography of the users' residence. Uncertainty about the performance of virtual user...
متن کاملCrowd-sourced data coding for the social sciences: massive non-expert human coding of political texts
A large part of empirical social science relies heavily on data that are not observed in the field, but are generated by researchers sitting at their desks. Clearly, third party users of such coded data must satisfy themselves in relation to both reliability and validity. This paper discusses some of these matters for a widely used type of coded data, derived from content analysis of political ...
متن کاملCrowd-sourced data coding for the social sciences: massive non-expert coding of political texts
A large part of empirical social science relies heavily on data that are not observed in the field, but are generated by researchers sitting at their desks, raising obvious issues of both reliability and validity. This paper addresses these issues for a widely used type of coded data, derived from the content analysis of political text. Comparing estimates derived from multiple “expert” and cro...
متن کاملبررسی روایی و پایایی نسخه فارسی پرسشنامه کاربرد پذیری SUS در کاربری علائم ترافیکی
Background and aims: Usability is the extent to which a system, product or service can be used by specified users to achieve specified goals with effectiveness, efficiency and satisfaction in a specified context of use. Increased public awareness of the usability issues has caused that usability plays an important role in production. Brooke’s SUS is one of the most used tool for measuring...
متن کاملCrowdLearn: Crowd-sourcing the Creation of Highly-structured e-Learning Content
While nowadays there is a plethora of Learning Content Management Systems, the collaborative, communitybased creation of rich e-learning content is still not sufficiently well supported. Few attempts have been made to apply crowd-sourcing and wiki-approaches for the creation of e-learning content. However, the paradigm is only applied to unstructured, textual content and cannot be used in SCORM...
متن کامل